Paraphrasing Swedish Compound Nouns in Machine Translation
نویسندگان
چکیده
This paper examines the effect of paraphrasing noun-noun compounds in statistical machine translation from Swedish to English. The paraphrases are meant to elicit the underlying relationship that holds between the compounding nouns, with the use of prepositional and verb phrases. Though some types of noun-noun compounds are too lexicalized, or have some other qualities that make them unsuitable for paraphrasing, a set of roughly two hundred noun-noun compounds are identified, split and paraphrased to be used in experiments on statistical machine translation. The results indicate a slight improvement in translation of the paraphrased compound nouns, with a minor loss in overall BLEU score.
منابع مشابه
Paraphrasing of Swedish Compound Nouns
The goal for this project is to examine and evaluate the effect of paraphrasing noun-noun compounds, with the aim of improving machine translation. The paraphrases will elicit the underlying relationship that holds between the compounding nouns, with the use of prepositional and verb phrases. Though some types of noun-noun compounds are too lexicalized, or have some other qualities that make th...
متن کاملExpanding Parallel Resources for Medium-Density Languages for Free
We discuss a previously proposed method for augmenting parallel corpora of limited size for the purposes of machine translation through monolingual paraphrasing of the source language. We develop a three-stage shallow paraphrasing procedure to be applied to the Swedish-Bulgarian language pair for which limited parallel resources exist. The source language exhibits specifics not typical of high-...
متن کاملNoun-Noun Compound Machine Translation A Feasibility Study On Shallow Processing
The translation of compound nouns is a major issue in machine translation due to their frequency of occurrence and high productivity. Various shallow methods have been proposed to translate compound nouns, notable amongst which are memory-based machine translation and word-to-word compositional machine translation. This paper describes the results of a feasibility study on the ability of these ...
متن کاملProcessing of Swedish Compounds for Phrase-Based Statistical Machine Translation
We investigated the effects of processing Swedish compounds for phrase-based SMT between Swedish and English. Compounds were split in a pre-processing step using an unsupervised empirical method. After translation into Swedish, compounds were merged, using a novel merging algorithm. We investigated two ways of handling compound parts, by marking them as compound parts or by normalizing them to ...
متن کاملSuper-function Based Machine Translation System for Business User
In today’s increasingly networked world, there is an increased need for language translation. Attempts of language translation are almost as old as computer themselves. Machine Translation (MT) is the attempt to automate all, or part of the process of translating between human languages and is one of the oldest large-scale applications of computer science. Developing a system that accurately pr...
متن کامل